A K-Means Based Clustering Approach for Finding Faulty Modules in Open Source Software Systems

نویسندگان

  • Parvinder S. Sandhu
  • Jagdeep Singh
  • Vikas Gupta
  • Mandeep Kaur
  • Sonia Manhas
  • Ramandeep Sidhu
چکیده

Prediction of fault-prone modules provides one way to support software quality engineering. Clustering is used to determine the intrinsic grouping in a set of unlabeled data. Among various clustering techniques available in literature K-Means clustering approach is most widely being used. This paper introduces K-Means based Clustering approach for software finding the fault proneness of the Object-Oriented systems. The contribution of this paper is that it has used Metric values of JEdit open source software for generation of the rules for the categorization of software modules in the categories of Faulty and non faulty modules and thereafter empirically validation is performed. The results are measured in terms of accuracy of prediction, probability of Detection and Probability of False Alarms. Keywords—K-Means, Software Fault, Classification, Object Oriented Metrics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Fault Proneness of Modules in Open Source Software Systems Using k-NN Clustering

Fault-proneness of a software module is the probability that the module contains faults. A correlation exists between the fault-proneness of the software and the measurable attributes of the code (i.e. the static metrics) and of the testing (i.e. the dynamic metrics). Early detection of fault-prone software components enables verification experts to concentrate their time and resources on the p...

متن کامل

Genetic Algorithm Based Approach for Finding Faulty Modules in Open Source Software Systems

Computer program produces an incorrect or unexpected result or behaves in haphazard way then there is an error in that computer program. In order to improve the software quality, prediction of faulty modules is necessary. Various Metric suites and techniques are available to predict the modules which are critical and likely to be fault prone. Genetic Algorithm is a problem solving algorithm. It...

متن کامل

An Approach to Early Fault Prediction in Software Systems Using K- Means Clustering

Quality of a software component can be measured in terms of fault proneness of data. Quality estimations are made using fault proneness data available from previously developed similar type of projects and the training data consisting of software measurements. To predict faulty modules in software data different techniques have been proposed which includes statistical method, machine learning m...

متن کامل

Analysis of Software Fault and Defect Prediction by Fuzzy C-Means Clustering and Adaptive Neuro Fuzzy C-Means Clustering

Faults are related to failures and they do not have much power for indicating a higher quality or a better system above the baseline that the end-users expect.The system faults are the defects that brim in executable files. Conventional approaches employ the experts to navigate directly into the source code errors. However expansion in system size grew the complexity of task exponentially and g...

متن کامل

A Clustering Based Location-allocation Problem Considering Transportation Costs and Statistical Properties (RESEARCH NOTE)

Cluster analysis is a useful technique in multivariate statistical analysis. Different types of hierarchical cluster analysis and K-means have been used for data analysis in previous studies. However, the K-means algorithm can be improved using some metaheuristics algorithms. In this study, we propose simulated annealing based algorithm for K-means in the clustering analysis which we refer it a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012